CDS

Accession Number TCMCG007C10034
gbkey CDS
Protein Id XP_033146534.1
Location complement(join(28460241..28460405,28460506..28460787,28460869..28460946,28461056..28461208,28461283..28461471,28461542..28461818,28461885..28462063,28462146..28462475,28462601..28462657,28462735..28463003,28463145..28463216,28463581..28463875,28463956..28464188,28464290..28464467,28464684..28464761,28464845..28464910,28464994..28465089,28465182..28465277))
Gene LOC103849006
GeneID 103849006
Organism Brassica rapa

Protein

Length 1030aa
Molecule type protein
Topology linear
Data_file_division PLN
dblink BioProject:PRJNA249065
db_source XM_033290643.1
Definition protein ALWAYS EARLY 2 isoform X2 [Brassica rapa]

EGGNOG-MAPPER Annotation

COG_category K
Description binding, transcription factor
KEGG_TC -
KEGG_Module -
KEGG_Reaction -
KEGG_rclass -
BRITE ko00000        [VIEW IN KEGG]
ko00001        [VIEW IN KEGG]
KEGG_ko ko:K21773        [VIEW IN KEGG]
EC -
KEGG_Pathway ko04218        [VIEW IN KEGG]
map04218        [VIEW IN KEGG]
GOs -

Sequence

CDS:  
ATGGCACCGACGGTTAGGAAGTCGAGGAGTGTGAACAAGCGTTTCACTAATGAACAACCCTCGCCAAAGAGGAGCTCTAGAGAGAATAAGCTGCGTAAGAAATTGTCTGATAAGCTGGGATCTCAATGGACCAAAGCAGAGCTTGAGCGGTTCTATGACTCTTACCGGAAGTACGGACAGGACTGGAGAAAGGTGGCTGCTGCTATTCGGAATAGCAGGAATGTTGAGATGGTGGAAGCTCTCTTTAATATGAATAAGGCATATTTGTCTCTTCCGGAAGGAACTGCCTCTGTAGCTGGCCTTATTGCTATGATGACCGATCATTACAGTGTCATGGAAGGGAGTGGCAGTGAAGGAGAAGGCCCTGATGTTTCAGAAACACCGAAGAAAGAGAAAAAGCGCAAACGTGCAAAGCCTCAGCTTAGTGATTCTCGAGAGGAAGTTGATAGAGATCATCCAGTTGCGTCCTCCACTGACGGATGTCTCAAGTTTTTGAAGCAAGCACGAGGTAATGGAACTCATCGACGTGCCACTGGCAAACGTACACCTCGTGTTCCTGTACAGACTTCACGGGATGATGGGGAAGGCTCTACTCCACCAAATAAAAGAGCCAGAAAGCAACAACGTGATGCCAATGATGATGTTGAGCGTTATTTAGAGTTAGCATTAATAGAAGCATCCAGAAGGGGAGGAGGGTCTCCAAAAGACCTCAGCGACAACTCACCAATAAAGAACTGGGAGAAAATGTCACGGACGAGGAAAGCTCAATCATGGGTGGGAAGTAGCCGAGAAAAGAAGCGTGAATCTGATATGGAAGAGGTTGGGGAAATGGAGGTTCCACGGAAGGGGAAAAGGGTCTACAAGAAGAGAGTAAAAGTCGAAGAAGCAGAGGGTGATTCTTCTGATGACAACGGAGGAGCAAGCAGTGCTACTGAGGGGCTCAGAGTTAAATCAAAGAGACGAAAGGCTGGTCGTGAAGCCTCAAGAGGGACATATTCACCGCGCAGCCCAAAGAACATAGATAACAAACTTACTTCCGGAGATGAATTTGATGCTCTGCAAGCTTTAGCTGAATTATCAGCTTCATTTCTTCCTTCAGCATTGATGGAATCAGAATCATCTCCTCAGGTGAAGGAAGAGAGAATAGAAAACGACATGGACGAGAAACCTAGCTCACCGGAAGCTACCACGTCCACCAGCAGTCATGGGGAAAAAGCAAATTCAGAACCAGATGAGAGTCTGCTACATGCAATCTCTGCTATTGGGAATGCTGTTTACAATAGAAAACCAAAACCTTCAACGCAAGCTTCAACTGATTGTAATGCTGGGAAGCTACAGCCGGAACCTACTAGTGCTAGTTTAAGAAGAAAACGCAAACCAAAGAAGCTAGGAGATGAATCACCACCTGATTCTTCTCAGAACAAATCCATAAACAAAAAGGAGTTAGCTCAAGAAAACCATAATATGAAGTCCTATCTTAGAACAAAACGCACTGGTCAAGGTCCCTCTCAGTCAAAACAGTTGAAAACTGCTAAGGAGTTGGAGGAATCTACTACAATGAGCGATAAGAAACATTCTGCTATGGATGTAGTAGTGTCAACTAAACAAGATTCTGATTCATGTCCAGCCACTTCACCACCACAGAAACCTCCAAACAGGCGTAAGGCGAGTCTGAAGAAAAGCTTACAAGAAAGAGCTAAATCTTCTGAAACCGTTCATAAAGTTCCTCGTAGTTCCAGATCTCTTTCAGAACAGGAGTTGTTATTAAAGGATGAGCTTTCTACTTATATGTCGTATCCCTTGGCACGTCGAAGGTGCATATTTGAGTGGTTTTATAGTGCTATCGACCATCCCTGGTTTGCAAAGATGGTGTTCGTCGATTACTTAAATCACGTGGGACTTGGTCACGTTCCAAGACTCACTCGTCTTGAATGGAGTGTCATTAAAAGCTCTCTTGGTAGAGCTCGAAGGTTCTCTGAGAGATTCTTACAGGAAGAGAGGGAGAAACTCAAGCAGTACCGTGAGTCTGTGAGAAAGCATTACACAGAGCTTCGAACTGGTGCTAGGGAAGGGCTTCCTACAGATTTGGCTCGGCCATTAGCAGTTGGTAACAGAGTCATTGCCATCCATCCCATAACACGAGAGATTCATGATGGGAAAATTCTCACTGTTGACCATAATCAATGCAATGTTCTGTTCGATGACTTGGGCGTTGAGTTAGTTAAGGACATTGATTGCATGCCTTCAAATCCATTGGAATACATGCCAGAAGGTCTAAGGAGGCAGATTGATAAGTGTTTATCCATGAAGAAAGAAGCACAACAAAATGGGAATCCAAACCTTGGTTTATCTGCTATTTTCCCTCCATATGGACTTGAAAATGCTGACTGTTCCATGAGTCATTCTCTGAATCAGGGTGATATGAATGCTCCTATTCTGCATGGTAAAGTATCAACCGACACTAGTATCCCACATCAGACTAATCAGTCATGTATCATAGATTATAGCAAAGGACGAGAAGCTGAGATTCAGCGAGCACTTGCTCTACAGCATGCTTTAGATGAAAAGGAAATGGAGCCAGAGATGCTAGAAATTGTCAAGGTCTCAAAGACAAGAGCGAAAGCAATGGTGGATGCAGCTATTACGGCTGCATCATCTGTGAAGGAAGGAGAAGATGCCATCAAAATGATCCAAGAAGCCTTAGACATGATTGGCAAACATCAGCCGTTACGCAGCTCTATAGTAGTCAAACAGGAAGAGAACGCAAGTGGCAGCATTGAGCATCATCATCACAACCCATCTCCCTCAGACGCATCAAAGCCTATGGCTAACAACGATTCGATCTCACAAAATGGTTCAGAGAAAAAAGAGGCTCAAATGCCTTCAGAGTTAATCACGTCCTGTGTTGCCACTTGGATCATGATTCAGATGTGCACGGAGAGGCAGTACCCTCCAGCTGATGTAGCGCAGCTTATGGACACAGCAGTCACAAGCTTGCAGCCTCGATGCCCCCAGAATCTACCGATCTACAGAGAAATCCAAATGTGTATGGGACGAATCAAGACTCAAATCATGTCTCTAGTACCAAGTTGA
Protein:  
MAPTVRKSRSVNKRFTNEQPSPKRSSRENKLRKKLSDKLGSQWTKAELERFYDSYRKYGQDWRKVAAAIRNSRNVEMVEALFNMNKAYLSLPEGTASVAGLIAMMTDHYSVMEGSGSEGEGPDVSETPKKEKKRKRAKPQLSDSREEVDRDHPVASSTDGCLKFLKQARGNGTHRRATGKRTPRVPVQTSRDDGEGSTPPNKRARKQQRDANDDVERYLELALIEASRRGGGSPKDLSDNSPIKNWEKMSRTRKAQSWVGSSREKKRESDMEEVGEMEVPRKGKRVYKKRVKVEEAEGDSSDDNGGASSATEGLRVKSKRRKAGREASRGTYSPRSPKNIDNKLTSGDEFDALQALAELSASFLPSALMESESSPQVKEERIENDMDEKPSSPEATTSTSSHGEKANSEPDESLLHAISAIGNAVYNRKPKPSTQASTDCNAGKLQPEPTSASLRRKRKPKKLGDESPPDSSQNKSINKKELAQENHNMKSYLRTKRTGQGPSQSKQLKTAKELEESTTMSDKKHSAMDVVVSTKQDSDSCPATSPPQKPPNRRKASLKKSLQERAKSSETVHKVPRSSRSLSEQELLLKDELSTYMSYPLARRRCIFEWFYSAIDHPWFAKMVFVDYLNHVGLGHVPRLTRLEWSVIKSSLGRARRFSERFLQEEREKLKQYRESVRKHYTELRTGAREGLPTDLARPLAVGNRVIAIHPITREIHDGKILTVDHNQCNVLFDDLGVELVKDIDCMPSNPLEYMPEGLRRQIDKCLSMKKEAQQNGNPNLGLSAIFPPYGLENADCSMSHSLNQGDMNAPILHGKVSTDTSIPHQTNQSCIIDYSKGREAEIQRALALQHALDEKEMEPEMLEIVKVSKTRAKAMVDAAITAASSVKEGEDAIKMIQEALDMIGKHQPLRSSIVVKQEENASGSIEHHHHNPSPSDASKPMANNDSISQNGSEKKEAQMPSELITSCVATWIMIQMCTERQYPPADVAQLMDTAVTSLQPRCPQNLPIYREIQMCMGRIKTQIMSLVPS